Benfords Law
(Operator Toolbox)
Synopsis
This operator tests, if columns in your data set are following Benford's LawDescription
Benford's Law states, that for real world applications often the distribution of leading digits of numbers is not uniform. Smaller digits (like 1 or 2) appear more often than larger digits. For more information please see: https://en.wikipedia.org/wiki/Benford's_law
This operator checks, if the first digit of the values ina numerical column follow this law. It first counts how often a leading digit occurs and calculates the frequency. This frequency is returned at the frequency port. Also the expected frequency according to Benford's law is reported.
In a second step a p-value is calculated. This value indicates how likely it is, that the observed distribution of leading digits is compatible with Benford's law. Low numbers mean, that the distribution is different to what's expected.
Input
exa (Data table)
The input example set
Output
frequency (Data table)
A table with the frequency of the first digit in the data set and the expected frequency if the data set would follow Benford's law.
pvalue (Data table)
The table with the pvalue ("probability") that a column follows Benford's Law
ori (Data table)
The original data set